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Reraarks 

In an Office action mailed October 13, 2006, claims 
1-7 were rejected under 35 U.S.C. § 101 on grounds that the 
claimed invention is directed to non- statutory subject matter. 

The Examiner incorrectly stated that the claims 
recite a method for performing a mathematical function. 
Instead, the claims actually recite "a method of operating an 
multiplication circuit... . " Thus, the claims are not directed 
to performing a mathematical function per se, but rather to a 
specific,, practical and concrete application in which hardware 
circuitry ("multiplication circuit", "accumulator", 
"registers", and "memory") is directed in a sequence of steps 
involving memory accesses ("loading" and "storing") and 
operations by a hardware multiplication circuit with 
accumulator ("multiplying" and ""adding") that together allow 
the circuitry as a whole to "compute the product of two 
operands". The scope of the claims is, therefore, understood 
in this real world context of operating the multiplication 
circuit, with access to memory, according to the particular 
sequence of steps set forth in the claims. It does not 
foreclose all use of the underlying mathematical principle or 
algorithm in the abstract, whether by hand or by machine 
generally, but only as it relates to the specified concrete 
operation of a multiplication circuit as claimed. 

The Examiner also asserts that "since there is no 
physical transform to establish a practical application, a 
useful, concrete and tangible result appears to be lacking." 
This is incorrect in both respects. There are numerous 
physical transformations occurring in the method, as claimed. 
The loading of word- wide operand segments from memory into the 
at least two registers (RX and RY) of the multiplication 
circuit involves a physical change of the electronic states of 
those registers every time a segment is loaded. The act of 
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multiplying necessarily involves physical transformations of 
the electronic states within the hardware of the 
multiplication circuit, otherwise, no outputs ( "two- word-wide 
intermediate products") would be obtained. The adding of 
intermediate products of the same weight in an accumulator 
likewise necessarily involves physical transformations of the 
electronic states with the hardware of the accumulator and 
wells as physical changes of the electronic states of the to- 
ward input register (R2) and two-word output register (RR) 
associated with that accumulator. The storing of accumulated 
results from the output register (RR) back into the memory 
involves a physical change of the electronic states of the 
memory. The particular results physically stored within that 
memory are functionally related to the operations of the 
multiplication circuit. 

The claimed method implemented in the 
multiplication circuit hardware provides a useful, concrete 
and tangible result possessing real world value. The 
multiplication circuit performs its claimed computation with 
far fewer memory accesses over that of other multiplication 
circuits (including the three cited in the Office action) . It 
provides a tangible result that can be stored physically in 
memory, which can then be accessed for further use. One 
exemplary use for the claimed method mentioned in the 
specification is in cryptography. Cryptographic applications 
generally require that computations be performed upon very 
wide operands for adequate security (typically at least 256 
bits for syrameti-.ic crypt osystems or at least 1024 bits for 
public-key cryptosystems) , much wider than the physical 
computation hardware, where the message blocks, cryptographic 
keys, and other cryptographic data are treated by the hardware 
performing the cryptographic application as if they were 
numeric values. Cryptographic applications Lend to be so 
computationally intensive, involving, for example, polynomial 
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arithmetic upon very large integer values, that any 
improvement in efficiency and speed of the computations have 
very real utility. The present invention offers such an 
improvement, since each memory access, whether to load an 
operand segment or store accumulated results, requires one 
cycle of time over and above the time required for the 
multiplying and accumulating by the multiplication circuit and 
accumulator hardware. Fewer memory accesses (a physical 
operation) , means a more efficient and faster computation 
overall by the hardware. 

The rejection under 35 U.S.C. § 101 is believed to 
be traversed. 

Obviousness Rejections 

Claims 1-8 were rejected under 35 U.S. C. § 103(a) 
as being unpatentable over Chevillat et al . (IBM Tech. Disc. 
B-ll.}. The Chevillat document was cited for disclosing a 
12x12 multiplier unit that can carry out a 12x16 
multiplication operation ("extended precision") . This means 
that at least one operand, namely the 16 -bit multiplicand, is 
wider than the multiplication circuit. The operation is split 
into two steps. In the first step, the 12 most significant 
bits of the 16-bit multiplicand are received (by the 12x12 
multiplier unit) through the B register. In the second step, 
the four remaining least significant bits of the multiplicand 
are received through the B register, right aligned with 8 
leading zeros. In both the first and second steps, the 12-bit 
multiplier is received through the A register. The partial 
result of the first step is stored in the accumulator. The 
result of the second partial multiplication is appropriately 
scaled by a shifter at the multiplier output, then added to 
the first partial result to obtain a final result in the 
accumulator. 



The Office action asserts that the multiplication 
circuit of Chevillat et ai. comprises: multiplier and 
multiplicand registers {these are the A and B registers) , a 
multiplier unit, ALU and accumulator. These features are 
clearly seen in the drawing figure on the front of the cited 
document. Further, the Office action notes that Chevillat et 
al . do not show that the accumulator has a size of u three 
words plus a number of carry bits", as recited in independent 
claims 1 and 8 of the application, but argues that it would 
have been obvious to use a accumulator having sufficient size 
to accommodate that desired final result of the multiplication 
circuit. Notwithstanding all of those statements, Applicants 
assert that the claimed invention would not have been obvious 
because Chevillat et al . fail to disclose or suggest the 
claimed "specified order" or "sequence" defined in the last 
indented section of method claim 1, nor a "operations 
sequencer" controlling that "sequence" as defined in the last 
two sections of claim 8. 

Applicants' multiplication circuit-implemented 
method and corresponding multiplication circuit with specified 
operations sequencer is provided to enable computation of very 
large products of two operands (claim X- "substantially wider 
than the multiplication circuit" and "each of the operands 
composed of a plurality of contiguous word-wide operand 
segments") , much larger than a mere 16x12 bit (1 % words by I 
word) multiplication and much larger than any accumulator 
could wholly accommodate. Even an operation as small as 
multiplying 8 words by 8 words operands, provided as an 
example in Figs. 4 through 5c of the present application, 
would require at least a wide 16-word accumulator (plus 
additional carry bits) if it were to be implemented as 
suggested in Chevillat et al. Many of the operations enabled 
by the present invention are ever larger (page 11,. line 34 - 
page 12, line 3) . Instead of widening the accumulator to 
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accommodate an entire product result, the accumulator is given 
a size only wide enough (claims 1 and 8:: "output of three 
words plus a number of carry bits") to accumulate intermediate 
products in "groups of two adjacent product weights" that are 
obtained in the defined sequence controlled by the operations 
sequencer. Because only intermediate products are added in 
the accumulator, not an entire product, accumulated results 
are stored back into memory "after accumulating all 
intermediate products of the specified weight", as set forth 
in our claims. None of this is described or suggested by the 
cited document. 

Claims 1-8 were also rejected under 3 5 U.S.C. § 
103(a) as being unpatentable over New et al. (U.S. Pat. No. 
4,809,212). New et al . was cited for disclosing a 32x32 
multiplication circuit having 64-bit operands wider than the 
multiplication circuit ( "double precision" ) , As noted in the 
Office action the multiplication circuit comprises multiplier 
and multiplicand registers {102, 106, 132, 136), a multiplier 
array (120), and an accumulator (adder 152, shifter 160 and 
register 166) . It is also noted that New et al . do not show 
the accumulator having a size of three words plus a number of 
carry bits, as set forth in Applicants' claims - the 
accumulator of New et al . being 67-bit wide, i.e., two words 
plus three bits. 

New et al . fail to disclose or suggest "loading 
word-wide operand segments of the two operands in. a specified 
order from the memory into the multiplication circuit, 
wherein the specified order for loading operand segments into 
said registers is a sequence defined by the intermediate 
product weights", as specified in Applicants' claim 1, nor the 
corresponding "operations sequencer" and its defined sequence 
specified in Applicants' claim 8. Indeed, New et al . has data 
registers for holding all of the 32-bit portions of the 64-bit 
double-precision operands, i.e., registers XA and XB for the 
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rnultiplicand and registers YA and YB for the multiplier. 
Thus, it does not really matter to New et al . ' s multiplication 
circuit in what order the. operand portions are loaded into the 
registers . 

in contrast, the "substantially wider" {claim 1) 
operands in the present invention could not be readily stored 
in registers all at the same time. For example, the 
relatively modest size 8-word by 8-word example, given in Figs. 
4 through 5c of the present application would require a total 
of 16 distinct data input registers (8 for each operand) if 
applied to the circuit construction described by New et al . 
The even larger multiplications that are possible with the 
present invention {page 11, line 34 to page 12, line 3) could 
not be so accommodated; they would require far too many 
registers. Instead, of having a separate data register for 
every operand segment, the present invention makes do with 
fewer data registers (e.g., RX, RY, RZ, RR in Fig. 3) for 
"temporarily holding" (claim 1) the operand segments, loading 
the operand segments vv in a specified order" (claim 1) from the 
memory. (See page 8, lines 1-3, page 8, line 14 - page 9, line 
5., and the description of the specified order or operation 
sequence beginning on page 10 . ) 

The present invention as claimed not only adds 
intermediate products of the same weight in an accumulator, 
but also stores accumulated results "back into said memory at 
least after accumulating all intermediate products of the 
specified weight" . The New et al . patent teaches away from 
this feature (see, e.g., col. 1, lines 25-37 and 60-68) by 
noting the slowness of off -chip data transfers. New et al . 
are able to avoid the need for external storage of temporary 
results or for any off-chip intermediate data transfers since 
they are performing a mere double-precision operation. 
Because at least one of the operands is substantially wider 
than the multiplication circuit in the present invention, 
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storage of temporary results (the accumulated intermediate 
products of the specified weight) back into memory cannot be 
avoided, but is minimized by the claimed feature of doing the 
multiplying step "in successive groups of two adjacent product 
weights" . This is not taught nor suggested by the cited New 
et al . patent . 

Claims 1-8 were also rejected under 35 U.S. C. § 
103(a) as being unpatentable over Rim (U.S. Pat. No. 
5,920,497) . Again, like New et al. this is a method and 
apparatus for performing "double precision" multiplication 
operations. As the operands are only two words wide, they can 
be completely stored in just four registers (LI, LO, Rl, and 
R0) . Again, the entire product can be accumulated without 
having to store intermediate results in external memory. The 
same features of the present invention that were noted in 
relative to Chevillat et al . and New et al . are also neither 
taught nor suggested by Rim. 
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Conclusion 

In light of the technical features already claimed, 
the invention is deemed, to be non-obvious and patentable over 
the cited prior art. Applicants request reconsideration of 
the claims in view of the arguments made herein. A Notice of 
Allowance is earnestly solicited. 
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